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5 NONCLONING TECHNIQUE FOR EXPRESSING 

A GENE OF INTEREST 

Technical Field 

■ The present invention relates generally to 
10 . genetic engineering and recombinant protein production 
techniques. In particular, the invention relates to a 
noncloning method for expressing a gene of interest in 
a mammalian host cell. 

15 Background of the Invention 

Recombinant technology provides an 
attractive method for producing proteins of high 
purity in large quantities. Conventional techniques 
for recombinant ly producing proteins in eukaryotes 

2 0 require that the gene of interest be first inserted 
into an expression plasmid, in vitro, prior to 
transf eetion of a host cell . The expression plasmid 
includes a promoter, which allows RNA polymerase to 
specifically bind to the DNA sequence in order to. 

25 initiate transcription. A polyadenylation signal may 
also be present. A selectable marker is used to allow 
the identification of host cells which express the 
gene of interest. 

Standard techniques for transfecting host 

30 cells require the use of two separate expression 

cassettes, one bearing the gene of interest driven by 
an appropriate promoter, and the other including a 
selectable marker driven by a separate promoter. 
These cassettes may be present in a single plasmid or 

35 may be delivered to the host cell using separate 
vectors. However, such techniques may result in 
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reduced recoveries of clones containing the gene of 
interest due to deletion or inactivation of the 
cassette expressing the same. Alternatively, if 
separate vectors are used, constructs including only 
5 the selectable marker and not the gene of interest, 
may be stably integrated into the host cell genome* 
Thus, background effects may be seen caused by false 
positive clones carrying only the marker plasmid and 
not the gene of interest . 

10 More recently, expression constructs have 

been developed which include one or more selectable 
markers, in addition to. the gene of interest, under 
the control of a single promoter. Such constructs are 
referred to in the art as Micistronic" or 

15 "bicistronic . " For example, constructs have been 
developed which include an internal ribosome entry 
site known as "IRES" derived from, e.g., the 
encephalomyocarditis virus (EMCV) . The IRES element 
permits the translation of two or more open reading 

20 frames from a single messenger RNA, one encoding the 

recombinant protein of interest and the other encoding 
the selectable marker. See, e.g., Kaufman et al . , 
Nuc. Acids Res. (1991) 19:4485-4490; Gurtu et al . , 
Biochem. Biophys. Res. Comm. (1996) 229 -.295-298; Rees 

25 et al . , BioTechniques (1996) 2^:102-110; Kobayashi et 
all,. BioTechniques (1996) 21:399-402; and Moss er et 
al.., BioTechniques (1997 22. 150-161. 

Although the use of a single expression 
cassette decreases the number of false positives, the 

30 technique requires that the gene of interest be first 
subcloned into an expression cassette including a 
promoter, prior to use. /If the expression of multiple 
genes is desired, such as from a cDNA library, the 
- -technique is particularly cumbersome, time- -consuming _ 

3 5 and labor-intensive. Accordingly, there is a 
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continued need for improved methods for producing 
proteins recombinant ly. 

Penolazzi et al . , Anal. Biochem. (1997) 
248 : 190-193 , describe the direct transfection of PCR- 
5 generated DNA fragments, labeled with ethididum 

bromide (EtdBr.) , into mammalian cells. The cells were 
reported to retain the EtdBr -DNA for 48 hours. 

Escher, D, and Schaffner, W. , BioTechniques 
(1996) 21:848-854, describe a system where a 
10 linearized recipient vector containing a yeast GAL4. 
DNA binding domain (DBD) under the control of a CMV 
promoter, and an SV40 origin of replication, is 
cotransf ected into a recipient mammalian cell with a 
fragmented DNA including a potential activation 
15 domain. The technique is said by the authors to 

produce functional fusion proteins- by in vivo ligation 
of the components. . 

However, none of the aforementioned 
references describes a noncloning system whereby a 
20 promoter element, a gene of interest, and a selectable 
marker, are- individually cotransf ected into a mammalian 
• cell to result in expression of the protein encoded by 
the- gene of interest. 

2 5 Disclosure of the Invention 

Accordingly, the present invention is based 
on the discovery of a noncloning method for producing 
a recombinant protein. The system of the invention 
eliminates the need to subclone the gene of interest- 

3 0 into an expression cassette prior to use and therefore 
provides a desirable method for producing a 
recombinant protein and a particularly convenient 
method for expressing several recombinant proteins, 
e.g., from a cDNA library. Moreover, the noncloning 

3 5 technique described herein allows for the direct use 

of a PCR product (either known or unknown) , the direct 
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use of synthetic DNA, as well as the direct use of DNA 
derived from a DNA virus or other organism. Thus, the 
method provides for rapid expression of a gene of . 
interest in mammalian cells. 
5 In addition, the system is designed such 

that expression of one or more selectable marker genes 
is under the control of the same promoter that drives 
expression of the gene of interest. Thus, the gene of 
interest and selectable marker are transcribed as a 

10 dicistronic message, reducing the problem of false 
positive clones, described above. 

In particular embodiments, the present 
invention also advantageously makes use of an 
attenuated marker gene which confers reduced 

15 resistance to a toxic molecule. Greater amounts of 
the selectable marker must therefore be expressed in 
order to overcome the weaker resistance conferred by 
the gene. Hence, cells selected based on expression 
of the marker may inherently contain higher levels of 

2 0 the recombinant product. 

Additionally, the system of the present 
invention is designed such that an IRES sequence is 
present upstream of the selectable marker gene. One 
advantage of structuring the system in this way is 
25 that those sequences downstream of the IRES elements, 
i.e., the selectable marker genes, are expressed less 
vigorously than the gene of interest which is under 
the control of the promoter element. Again, cells 
which are selected based on expression of the 

3 0 selectable marker gene will inherently be higher 

expressors of the polypeptide of interest. 

Accordingly, in one embodiment, the 
invention is directed to a method of expressing a 
recombinant polypeptide comprising: 
35 (a> providing as separate elements (i) a 

first nucleic acid element which comprises a promoter, 
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(ii) a second nucleic acid element which comprises at 
least one selectable marker gene conferring a' 
selectable phenotype on a cell transfected therewith, 
an internal ribosome entry site (IRES) sequence 
5 positioned upstream of the selectable marker gene, and 
a transcription termination sequence positioned 
downstream of the selectable marker gene, and (iii) a 
third nucleic acid element which comprises a gene 
encoding said polypeptide ; 
10 (b) cotransf ecting a population of mammalian . 

cells with each of said first, second and third 
elements; 

(c) culturing said population of cells under 
conditions whereby the polypeptide and the at least 
15 one selectable marker are expressed; 

' .(d) selecting cells which express the 
selectable marker; and 

(e) identifying those cells from the 
selected cells that express the recombinant 
20 polypeptide. 

In particularly preferred embodiments, the 
selectable marker gene is selected from the group 
consisting of a neo r gene and a dhfr gene and the IRES 
sequence is derived from an encephalomyocarditis virus 
25 (EMCV) . . 

In still a further embodiment, the invention 
is directed to a selectable marker nucleic acid 
element comprising: (a) a neo r gene, (b) a dhfr gene 
positioned upstream of the neo r gene, wherein the stop 
3 0 codon for the dhfr gene is replaced with a spacer of 
about 3 to about 150 nucleotides, such that 
transcription and translation of both the dhfr and neo r 
genes occurs, (c) an internal ribosome entry site 
(IRES) sequence positioned upstream of the dhfr gene, 
3 5 (d) and a transcription termination sequence 
positioned downstream of the neo r gene. 
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In another embodiment, the invention is 
directed to a selectable marker nucleic acid element 
comprising: (a) a neo r gene which confers reduced 
resistance to geneticin on a cell transfected 
5. therewith as compared to the wild- type neo r gene, (b) a 
dhfr gene positioned upstream of the neo r gene, wherein 
the stop codon for the dhfr gene is replaced with a 
spacer of about 12 to about 18 nucleotides, such that 
transcription and translation of both the dhfr and neo r 

10 genes occurs, (c) an encephalomyocarditis virus 
internal ribosome- entry site (IRES) sequence 
positioned upstream of the dhfr gene, (d) and a bovine 
growth hormone transcription termination sequence 
positioned downstream of the neo r gene. 

15 In yet further embodiments, the subject 

invention is directed to mammalian host cells 
transfected with the above selectable marker nucleic 
acid elements. 

These and other embodiments of the present 

2 0 invention will readily occur to those of ordinary 

skill in the art in view of the disclosure herein. 

Brief Description of the Figures 

Figure 1 depicts plasmids pCMVII and pET-E- 
25 DHFR/Neo (m2) BGHt , used as sources of the CMV promoter 
element and selectable marker elements, respectively, 
as described in the examples. 

Figure 2 shows one strategy used to effect 
expression of a gene of interest in a mammalian host 

3 0 cell using the system of the present invention. A CMV 

promoter element, selectable marker cassette, as well 
as a nucleic acid element comprising a gene encoding 
an hepatitis B virus (HBV) preS2 polypeptide (termed 
"G.O.I." (gene of interest) ) is shown in the figure. _ 
3 5 Figure 3 shows sAg titers of from cells 

transfected as shown in Figure 2, prior to 
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. methotrexate (Mtx) . amplification in dhfr selective 
medium. The arrows indicate clones subsequently 
subjected to Mtx selection,, as described in the 
examples. 

5 Figures 4A and 4B depict amplification 

results, for .subclones T21.18 (Figure 4A) and T21.50 
(Figure 4B) , as described in the examples. 

Figure 5 shows sAg titers from subclones 
derived from the.T21.18 parental clone following 
10 extended passaging, as described in the examples •■ The 
degree of amplification over the parental clones is 
also shown. 

Figure 6 shows the results of an 
immunoprecipitation experiment using rabbit anti-HBV 
15 sAg. Labelled cell lysates or culture medium from 
subclone T21. 18 . 25 cells (lanes 2 and 4) or control 
CHOdg44 cells (lanes 1 and 3) were immunoprecipitated 
as described in the examples.* 

20 Detailed Description of the Invention 

The practice of the present invention will 
employ, unless otherwise indicated, conventional 
methods of virology, microbiology, molecular biology, 
recombinant DNA techniques and immunology within the 

25 skill of the art. Such techniques are explained fully - 
in the literature. See, e.g., Sambrook, et al . , 
Molecular Cloning: A Laboratory Manual (2nd Edition, 
1989); DNA Cloning: A Practical Approach , vol . I & II 
(D. Glover, ed.); Oligonucleotide Synthesis (N. Gait, 

30 ed., 1984) ; A Practical Guide to Molecular Cloning 

(1984); Fundamental Virology, 2nd Edition, vol. I & II 
(B.N. Fields and D.M. Knipe, eds . ) ; and Methods In 
Enzymology (S. Colowick and N. Kaplan, eds., Academic 
Press , Inc . ) . 

35 As used in this specification and the 

appended claims,, the singular forms "a," "an" and 
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"the" include plural references unless the content 
clearly dictates otherwise. 



I. Definitions 

In describing the present invention, the 
following terms will be employed, and are intended to 
be defined as indicated below. 

The terms "polypeptide 11 and "protein" refer 
to a polymer of amino acid residues and are not 
limited to a minimum length of the product. Thus, 
peptides, oligopeptides, dimers, multimers, and the 
like, are included within the definition. Both full- 
length proteins and fragments thereof are encompassed 
by the definition. The terms also include 
postexpressibn modifications of the polypeptide, for 
example, glycosylation, acetylation, phosphorylation 
and the like . 

For purposes of the present invention, the 
polypeptide expressed by the gene of interest may be 
one useful in a vaccine, therapeutic or diagnostic and 
may be derived from any of several known viruses, 
bacteria, parasites and fungi, as well as any of the 
various tumor antigens. Alternatively, the expressed 
polypeptide may be a therapeutic hormone, a 
transcription or translation mediator, an enzyme, an 
intermediate in a metabolic pathway, an 
immunomodulator , and the like. 

Furthermore, for purposes of the present 
invention, a "polypeptide" refers to a protein which 
includes modifications, such as deletions , additions 
and substitutions (generally conservative in nature) , 
to the native sequence, so long as the protein 
maintains the desired activity. These modifications 
may be deliberate, as through site-directed 
mutagenesis, or may be accidental, such as through 
mutations of hosts which produce the proteins or 
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errors due to PGR amplification. A "coding 

sequence" or a sequence which "encodes" a selected 
polypeptide, is. a.. nucleic acid molecule which is 
transcribed (in the case of DNA) and translated (in 
5 the case of mRNA) into a polypeptide in vivo when 
placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are 
determined by a start codon at the 5' (amino) terminus 
and a translation stop codon at the 3' (carboxy) 

10 terminus. A coding sequence can include, but is not 

limited to, cDNA from viral, procaryotic or eucaryotic 
mRNA, genomic DNA sequences from viral (e.g. DNA 
viruses and retroviruses) or procaryotic, DNA, and 
especially synthetic DNA sequences. A transcription 

15 termination sequence may be located 3' to the coding 
sequence 

A "nucleic acid" molecule can include both 
double- and single-stranded sequences and refers to, 
but is not limited, to, cDNA from viral , procaryotic or 
20 eucaryotic mRNA, genomic DNA sequences from viral 

(e.g. DNA viruses and retroviruses) or procaryotic : 
DNA, and especially synthetic DNA sequences. The term 
also captures sequences that include any of the known 
base .analogs of DNA and RNA. 

2 5 "Operably linked" refers to an arrangement 

of elements wherein the components so described are 
configured so as to perform their desired function. 
Thus, a given promoter operably. linked to a coding 
sequence is capable of effecting the expression of the 

3 0. coding sequence when the proper transcription factors, 

. etc., are present. The promoter need not be 
contiguous with the coding sequence, so long as it 
functions to direct the expression thereof. Thus, for 
example, intervening untranslated yet transcribed 
3 5 sequences can be present between the promoter sequence 
and the coding sequence, as can translated introns, 
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and the promoter sequence can still be considered 
"operably linked" to the' coding sequence. 

"Recombinant " as used herein to describe a 
nucleic , acid molecule means a polynucleotide of 
5 genomic, cDNA, viral, semisynthetic , or synthetic 

origin which> by virtue of its origin or manipulation 
is not associated with all or a portion of the 
polynucleotide with which it is associated in nature. 
The term "recombinant" as used with respect to a 

10 protein or polypeptide means a polypeptide produced by 
expression of a recombinant polynucleotide. 

A "control element" refers to a 
polynucleotide sequence which aids in the expression 
of a coding sequence to which it is linked. The term 

15 includes promoters, transcription termination 
sequences, upstream regulatory domains , 
. polyadenylation signals, and when appropriate, leader 
sequences and enhancers, which collectively provide 
for the transcription and translation of a coding 

2 0 sequence in a host cell. 

A "promoter" as used herein is a DNA 
regulatory region capable of binding RNA polymerase in 
a mammalian cell and initiating transcription of a 
downstream (3 ' direction) coding sequence operably 

25 linked thereto. For purposes of the present 

invention, a promoter sequence includes the minimum 
number of bases or elements necessary to initiate 
transcription of a gene of interest at levels 
detectable above background. Within the promoter 

30 sequence is a transcription initiation site, as well 
as protein binding domains (consensus sequences) 
responsible for the binding of RNA polymerase. 
Eucaryotic promoters will often, but not always, 
contain- "TATA"- -boxes -and "GAT" boxes . 

35 By "selectable marker" is meant k gene which 

confers a phenotype on a cell expressing the marker, 
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such that the cell can be identified under appropriate 
conditions. Generally, a selectable marker allows 
selection of transected cells based on their ability 
to thrive in the presence or absence of a chemical or 
5 other agent that inhibits an essential cell function. 
Suitable markers, therefore, include, genes coding for 
proteins which confer, drug resistance or sensitivity 
thereto, impart color to, or change the antigenic 
characteristics of those cells transfected with' a 

10 nucleic acid element containing the selectable marker 
when the cells are grown in an appropriate selective 
medium. For example, selectable markers include: 
cytotoxic markers and drug resistance markers, whereby 
cells are selected by, their ability to grow on media 

15 containing one or more of the cytotoxins or drugs; 
auxotrophic markers by which cells are selected by 
their ability to grow on defined media with or without 
particular nutrients or supplements, such as thymidine 
and hypoxanthine ; metabolic markers by which cells are 

20 selected for, e.g., their ability to grow on defined 
media containing the appropriate sugar as the sole 
carbon source, or markers which confer the ability of 
cells to form colored colonies on chromogenic 
substrates or cause cells to fluoresce. 

25 Representative selectable markers are described in 
more detail below. 

"Transformation, 11 as used herein, refers to 
the insertion of an exogenous polynucleotide into a 
host cell, irrespective of the method used for 

30 insertion: for example, transformation by direct 

uptake, transf ection, infection, and the like. For 
particular methods of transf ection, see further below. 
The exogenous , polynucleotide may be maintained as a 
nonintegrated vector, for example, an episome, or 

35 alternatively, may be integrated into the host genome . 
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By "cotransf ection" is meant the 
simultaneous or substantially simultaneous delivery of 
more than one nucleic acid element to the same host 
cell or population of cells, regardless of the method 
5 used. Thus, the various nucleic acid elements of the 
present invention may be delivered simultaneously. by 
combining all elements in the transfectidn reaction or 
by sequentially delivering the various elements, in 
any order, to the host cell or population of cells. 

10 "Homology 41 refers to the percent identity 

between two polynucleotide or two polypeptide 
moieties. Two DNA, or two polypeptide sequences are 
"substantially homologous" to each other when the 
sequences exhibit at least about 75%-85%, preferably 

15 at least about, 90%, and most preferably at least about 
95%- 98% sequence identity over a defined length of the 
molecules. As used herein, substantially homologous 
also refers to sequences showing complete identity to 
the specified DNA or polypeptide sequence. 

20 Percent "identity" between two amino acid or 

polynucleotide sequences can be determined by a direct 
comparison of the sequence information between two 
molecules by aligning the sequences, counting the 
exact number of matches between the two aligned 

25 sequences, dividing by the length of the shorter 

sequence, and multiplying the result by 100. Readily 
available computer programs can be used to aid in the 
analysis, such as ALIGN, Dayhoff, M.O. in Atlas of 
Protein Sequence and Structure M.O. Dayhoff ed. , 5 

30 Suppl . 3.:353-358, National biomedical Research 

Foundation, Washington, DC, which adapts the local 
homology algorithm of Smith and Waterman (1981) . 
Advances in Appl . Math. 2:482-489 for peptide 
-analysis. Programs for determining nucleotide 

35 sequence identity . are available in the Wisconsin 

Sequence Analysis Package, Version 8 (available from 
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Genetics Computer Group, Madison, WI) for example, the 
BESTFIT, FASTA and GAP programs, which also rely on 
the Smith and Waterman algorithm. These programs are 
readily utilized with the default parameters 
5 recommended by the manufacturer and described in the 
Wisconsin Sequence Analysis Package referred to above. 
For example, percent identity of a particular 
nucleotide sequence to a reference sequence can be 
determined using the homology algorithm of , Smith and 

10 Waterman with a default scoring table and a gap 
penalty of six nucleotide positions . 

Alternatively, identity can be determined by 
hybridization of polynucleotides under conditions 
which form stable duplexes between homologous regions, 

15 followed by digestion with single-stranded-specific 
nuclease (s), and size determination of the digested 
fragments. DNA sequences that are substantially 
homologous can be identified in a Southern 
hybridization experiment under, for example, stringent 

20 conditions, as defined for that particular system. 
Defining appropriate hybridization conditions is 
within the skill of the art. See, e.g., Sambrook et 
al., supra- t DNA Cloning, supra; Nucleic Acid 
Hybridization, supra. 

25 

II. Modes of Carrying Out the Invention 
The present invention is based on the 
development of a novel noncloning method for 
recombinantly producing a desired protein. In 

30 particular, the invention utilizes three basic 

elements: (1) a promoter element; (2) at least one 
gene of interest; and (3) a selectable marker cassette 
which includes in 5' to 3' order, an internal ribosome 
entry site ("IRES"), at least one gene coding for a 

35 selectable marker, a transcription termination 

sequence and, optionally, a polyadenylation signal. 
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The three basic elements are cotransf ected into a 
mammalian host cell where they become operably linked 
such that expression of the gene of interest and the 
selectable marker gene occur. 
5 In order to facilitate appropriately ordered 

ligation of the various fragments in the host cell, . 
the DNA fragment bearing the gene of interest may be 
treated with appropriate enzymes such that cohesive 
termini complimentary to the 3' and 5' termini of the 

10 promoter and selection cassette fragments , 

respectively are produced. . In this way, ordered 
association of the fragments may occur in viva. 

The noncloning technique of the present 
invention eliminates the need to subclone the gene of 

15 interest and can be used in a wide variety of 

contexts, such as to express PGR products encoding 
both known and unknown proteins, as well as for the 
expression of synthetic nucleic acid sequences, cDNA 
libraries, genomic DNA derived directly from a virus 

20 or organism of interest, and the like. 

Furthermore, particular embodiments of the 
present invention take advantage of selectable marker 
nucleic acid elements designed to select cells which 
are high protein expressors. For example, attenuated 

25 drug resistance genes can be used in the selectable 
marker /elements which require greater amounts of the 
selectable marker to be expressed in order to overcome 
the weaker resistance conferred by the gene. Hence, 
cells selected based on expression of the marker will 

30 inherently contain higher levels of the recombinant 
product. 

Additionally, the selectable marker elements 
•are designed with an IRES sequence such that the 
'"• downstream selectable marker genes are expressed less 
3 5 vigorously than the gene of interest which is under' 
the control of the promoter element. Thus, cells 
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which are selected based on expression of the • 
selectable marker gene will inherently be high 
expressors of the polypeptide of interest. Finally, 
more than one marker gene and/or amplifiers may be 
5 - used in the selectable marker construct to assure 
enhanced expression of the selectable marker gene. 

Thus, the methods of the present invention 
will find use for the expression of a wide variety of 
substances, including peptides which act as 

10 antibiotics and antiviral agents, e.g., immunogenic 
peptides for use in vaccines and diagnostics; 
antineoplastics; immunomodulators , such as any of the 
various cytokines including interleukin- 1 , 
interleukin-2 , interleukin- 3 , interleukin-4 , and 

15 gamma- interferon; peptide hormones such as insulin, 
proinsulin, growth hormone, GHRH, LHRH, EGF, 
somatostatin, SNX-111, BNP, insulinotropin, ANP , FSH, 
LH, PSH and :hCG, gonadal steroid hormones (androgens, 
estrogens and progesterone) , thyroid- stimulating 

2 0 hormone, inhibin, cholecystokinin, ACTH, CRF, 

dynorphins, endorphins, endothelin, fibronectin 
fragments, galanin, gastrin, insulinotropin, glucagon, 
GTP-binding protein fragments, guanylin, the 
leukokinins, magainin, mastoparans, dermaseptin, 

25 systemin, neuromedins, neurotensin, pancreastat in, 
pancreatic polypeptide, substance P, secretin, 
thymosin, and the like; and growth factors, such as 
PDGF, EGF, KGF, IGF-1 and IGF-2, FGF, and the like. 

More particularly, proteins for use in 

30 vaccines and diagnostics may be of viral, bacterial, 
fungal or parasitic origin, including but not limited 
to, those encoded by human and animal viruses and can 
correspond to either structural or non-structural 
. proteins. For example, the present system will find 

35 ' use for recombinantly . producing a wide variety of 
proteins from the herpesvirus family, including 
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proteins derived from herpes simplex virus (HSV) types 
1 and 2, such as HSV-1 and HSV-2 glycoproteins gB, gD 
. and gH; proteins derived from varicella zoster virus 
(VZV) , Epstein-Barr virus (EBV) and cytomegalovirus' 
5 (GMV) including CMV gB and gH; and proteins derived 
from other human herpesviruses such as HHV6 and HHV7 . 
(See, e.g. Chee et al . , Cytomegaloviruses (J.K. 
McDougall, ed. , Springer -Verlag 1990) pp. 125-169, for 
a review of the protein coding content of 

10 cytomegalovirus; McGeoch et al . , J. Gen. Virol. (1988) 
£9:1531-1574, for a discussion of the various HSV-1 
encoded proteins; U.S. Patent No. 5,171,568 for a 
discussion of HSV-1 and HSV- 2 gB and gD proteins and 
the genes encoding therefor; Baer et al . , Nature 

15 (1984) 310 :207-211, for the identification of protein 
coding sequences in an EBV genome; and Davison and 
Scott,. J". Gen. Virol. (1986) £7: 1759-1816, for a 
review of VZV.) 

Polynucleotide sequences encoding proteins 

20 from the hepatitis family of viruses , including 

hepatitis A virus (HAV) , hepatitis. B virus (HBV) , 
hepatitis C virus (HCV) , the delta hepatitis virus 
(HDV) , hepatitis E virus (HEV) and hepatitis G virus 
(HGV) , can also be conveniently used in the techniques 

25 described herein. . By way of example, the viral 

genomic sequence of HCV is known, as are methods for 
. obtaining the sequence. See, e.g., International 
Publication Nos . WO 89/04669; WO 90/11089; and WO 
90/14436. The HCV genome encodes several viral 

30 proteins, including El (also known as E) and E2 (also 
known as E2/NSI) . (See, Houghton et al . , Hepsitology 
(1991) 14.:381-388, for a discussion of HCV proteins, 
including El and E2 . ) The sequences encoding each of 
these proteins, as well as antigenic fragments - 

35 thereof, will find use in the present methods. 

Similarly, the coding sequence for the 6-antigen from 
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HDV is known (see, e.g., U.S. Patent No. 5,378,814) 
and this sequence can also be conveniently used in the 
present methods .. Additionally, antigens derived from 
HBV, such as the core antigen, the surface antigen, 
5 sAg, as well as the presurface sequences, preSl and 

preS2 (formerly called preS) , as well as combinations 
of the above, such as sAg/preSl, sAg/preS2, 
sAg/preSl/preS2 , and preSl/preS2, will find use 
herein. See, e.g., "HBV Vaccines - from the 

10 - laboratory to license: a case, study" in Mackett, M . 

and Williamson, J.D., Human Vaccines and Vaccination, 
pp., 159-176, for a discussion of HBV structure; and 
U.S. Patent Nos . 4,722,840, 5,098,704., 5,324,513; 
Beames et al . , J. Virol. (1995) .69:6833-6838, Bimbaum 

15 et al., J. Virol. (1990) 64 : 3319-3330 ; and Zhou et 
al . , J". Virol. (1991) 6j5: 5457-5464 . 

Polynucleotide sequences encoding proteins 
derived from other viruses will also find use in the 
claimed methods, such as without limitation, proteins 

2 0 from members of the families Picornaviridae (e.g., 

polioviruses , etc.); Caliciviridae ; Togaviridae (e.g., 
rubella virus, dengue virus, etc.); Flaviviridae ; 
Coronaviridae; Reoviridae; Birnaviridae ; 
Rhabodoviridae (e.g., rabies virus, etc.) ; 

25 Filoviridae; Paramyxoviridae (e.g., mumps virus, 

measles virus, respiratory syncytial virus, etc.) ; 
Orthomyxoviridae (e.g., influenza virus types A., B and 
C, etc.) ; Bunyaviridae ; Arenaviridae ; Retroviradae 
(e.g., HTLV-I; HTLV-II; HIV- 1 (also known as HTLV-III, 

30 LAV, ARV, hTLR, etc.)), including but not limited to 

antigens from the isolates HIV IIIb , HIV SF2/ HIV^, HIV^, 
HIV^) ; HIV- 

I-CM235/ HIV-1 US4 ; HIV- 2; simian 
immunodeficiency virus (SIV) among others. See, e.g. 
Virology, 3rd Edition (W.K. Joklik ed. 1988); 
35 Fundamental Virology, 2nd. Edition (B.N. Fields and 
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£>.M. Knipe, eds . 1991) , for a description of these and 
other viruses . 

For example, the invention may be used to 
express genes encoding the gpl2 0 envelope protein from 
5 any of the above HIV isolates. The gpl20 sequences . 
for a multitude of HIV-1 and HIV- 2 isolates, including 
members of the various genetic subtypes of HIV, are 
known and reported (see, e.g., Myers et al . , Los 
Alamos Database, Los Alamos National Laboratory, Los 

10 Alamos, New Mexico (1992) ; Myers et al . , Human 

Retroviruses and Aids., 1990, Los Alamos, New . Mexico: 
Los Alamos National Laboratory; and Modrow et al . , J. 
Virol. (1987) 6JL: 570-578, for a comparison of the 
envelope gene sequences of a variety of HIV isolates) 

15 r and sequences derived from any of these isolates will 
find use in the present methods. Furthermore, the 
invention is equally applicable to other immunogenic 
proteins derived from any of the various HIV isolates, 
including any of the various envelope proteins such as 

20 gpl60 and gp41, gag antigens such as p24gag and 
p55gag, as well as proteins derived from the pol 
region . 

The present invention will also find use for 
the expression of influenza virus proteins. 

25. Specifically, the envelope glycoproteins HA and NA of 
influenza A are of particular interest for generating 
an immune response. Numerous HA subtypes of influenza 
A have been identified (Kawaoka et .al.., Virology 
(1990) 179 : 759-767 ; Webster et al . , "Antigenic 

30 variation among type A influenza viruses, 11 p. 127-168. 
In: P. Palese and D.W. Kingsbury (ed.), Genetics of 
influenza viruses Springer- Verlag, New York) . Thus, 
the gene sequences encoding proteins derived from any 
of these isolates can also be used in the recombinant 

3 5 production techniques described herein. 
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Furthermore, the methods described herein 
provide a means for producing proteins useful for 
treating a variety of malignant cancers. For example,, 
the system of the present invention can be used to 
5, produce a variety of tumor antigens which in turn may 
be used to mount both humoral and cell -mediated immune 
responses to particular proteins specific to the 
cancer in question, such as an activated oncogene , a. 
fetal antigen, or an activation marker. Such tumor 

10. antigens include any of the various MAGEs (melanoma 

associated antigen E) , including MAGE 1, 2, 3, 4, etc. 
(Boon, T. Scientific American (March 1993) : 82-89) ; any 
of the various tyrosinases; MART 1 (melanoma antigen 
recognized by T cells), mutant ras ; mutant p53; p97 

15 ' melanoma antigen; CEA ( care inoembryonic antigen) , 
among others . 

It is readily apparent . that the subject 
invention can be used to produce a variety of proteins 
useful for the prevention, treatment and/or diagnosis 

2Ch of a wide variety of diseases. 

Polynucleotide sequences coding for the 
above-described molecules can be obtained using 
recombinant methods, such as by screening cDNA and 
genomic libraries from cells expressing the gene, or 

25 by deriving the gene from a vector known to include 
the samel Furthermore, the desired gene can be 
isolated directly from cells and tissues containing 
the. same, using standard techniques, such as phenol, 
extraction and PCR of cDNA or genomic DNA. See, e.g., 

30 Sambrook et al . , supra,, for a description of, 

techniques used to obtain and isolate DNA. The gene 
of interest can also be produced synthetically, rather 
than cloned. The nucleotide sequence can be designed 
with the appropriate codons for the particular amino. 

3 5 acid sequence desired. In general, one will select 
preferred codons for the intended host in which the 
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sequence will be expressed. The complete sequence is 
assembled from overlapping oligonucleotides prepared 
by standard methods and assembled into a complete 
coding sequence. See, e . g. , Edge , Nature (19.81) 
5 292 : 756 ; Nambair et al . , Science (1984) 223 : 12 99 ; Jay 
et.al., J. Biol. Chem. (1984) 259 :6311. 

. The promoter for use in the present 
invention is. one which will direct transcription of 
the gene of interest, as welj as the selectable 
10 marker, in a mammalian host cell when operably linked 
thereto \ Typical promoters for mammalian cell 
expression include the SV40 early promoter, a CMV 
promoter such as the CMV immediate early promoter, the 
mouse mammary tumor virus LTR promoter, the adenovirus 
15 major late promoter (Ad MLP) , and the herpes simplex 
virus promoter, among others. Other nonviral 
promoters, such as a promoter derived from the murine 
metallothionein gene, will also find use for mammalian 
expression. These and other promoters can be obtained 
20, from commercially available plasmids, using techniques 
well known in the art. See, e.g., Sambrook et al . , 
supra. Enhancer elements may used in association with 
the promoter to increase expression levels of the 
mammalian constructs. Examples include the SV40 early 
25 gene enhancer, as described in Dijkema et al . , EMBO J". 
(1985) 4.: 761, the enhancer/promoter derived from the 
long terminal repeat (LTR) of the Rous Sarcoma Virus, 
as described in Gorman et al . , Proc . Natl. Acad. Sci . 
USA (1982b) T9\6111 and elements derived from human 
30 CMV, as described in Boshart et al . , Cell (1985) 

41 : 521 . such as elements included in the CMV intron A 
sequence 

A particularly preferred promoter for use 

with the present invention is the human • 

3 5 cytomegalovirus immediate -early enhancer /promoter , 
known as HCMV IE1, which includes the CMV intron A. 
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See, e.g., Chapman et al . , Nuc . Acids Res. (1991) 
19:3979-3986; Boshart et al.. Cell (1985) 41:521. 
This promoter is found in, e.g., plasmid pCMV6 . See, 
e.g., Chapman et al . , supra; and U.S. Patent No. 
5 5,688,688. 

As explained above/ the system of the 
invention also includes an element encoding one or 
more selectable markers. A number of suitable 
selectable markers can be used in the practice of the 

10 invention, such as a marker which imparts resistance 
to a' cytotoxic agent to the transformed mammalian 
cell. The cytotoxic agent can be, but is not limited 
to, neomycin and neomycin analogues such as genet icin, 
hygromycin, and the like. For example, the gene 

15 encoding aminoglycoside -phosphotranf erase (APH) allows 
selection in mammalian cells by conferring resistance 
to neomycin. Several mutations to this gene have been 
described which impart resistance to the neomycin 
analogue geneticin (G418) (available from Sigma, St. 

2 0 Louis, MO.) . For example, an APH with aspartic acid _ 
at position 261 replaced by an asparagine, confers 
reduced resistance to gentamicin. See, e.g., Blazquez 
et al., Molec. Micro. (1991) 5:1511-1518. Thi s , or 
other selectable markers that confer attenuated 

25 resistance to toxic agents as compared to their wild- 
type counterparts are particularly preferred as such 
genes require larger amounts of. protein to be 
expressed in order to overcome the weaker resistance 
conferred by the attenuated gene.. 

30 Other markers useful herein include cell 

surface markers such as alkaline phosphatase, nerve 
growth factor receptor, or any other suitable 
membrane-associated moiety. Representative examples 
of such markers and associated prodrug molecules 

35 include alkaline phosphatase and various toxic 
phosphorylated compounds such as phenolmustard 

-21- 



WO 99/53046 PCT/US99/08069 

phosphate , doxorubicin phosphate , . mitomycin phosphate 
and etoposide phosphate; /8-galactosidase and 
■ N- [4- (£-D-galactopyranosyl) 
benyloxycarbonyl] -daunorubicin; azoreductase and 
5 azobenzene mustards; /3-glucosidase and amygdalin; 
/3-glucuronidase and phenolmustard-glucuronide and 
epirubicin-glucuronide ; carboxypeptidase A and 
methotrexate - alanine ; cytochrome P450 and 
cyclophosphamide or if osf amide; DT diaphorase and 

10 5- (aziridine-l-yl) -2 , 4 , dinitrobenzamide (CB1954) (Cobb 
et al, (1969) Biochem. Pharmacol .18.: 1519 , Knox et al . 
(1993) Cancer Metastasis -Rev. 12,:195); /3-glutamyl 
transferase and ^-glutamyl p-phenylenediamine mustard; 
nitroreductase and CB1954 or derivatives of 

15 4-nitrobenzyloxycarbonyl ; glucose oxidase and glucose ; 
xanthine oxidase and hypoxanthine ; and plasmin and 
peptidyl-p- phenyl enediamine -mustard. Nonimmunogenic 
markers may also be made by expressing an enzyme in a 
compartment of the cell where it is not normally 

20 expressed. 

Still other suitable markers are genes which 
impart color to those cells transfected with a nucleic 
acid element containing the selectable marker such 
that detection can be achieved by virtue of a color 

25 change (either visible or fluorescent) . For example, 
the gene encoding Green Fluorescent Protein (GFP) may 
be used as the selectable marker, as can derivatives 
thereof such as Enhanced Green Fluorescent Protein 
(EGFP) , and like molecules. 

3 0 These and other selectable markers can be 

obtained from commercially available plasmids, using 
techniques well known in the art. See, e.g., Sambrook 
et al . , supra. 

Expression can also be amplified by placing - 

35 an amplifiable gene, such as the mouse dihydrof olate , 
reductase {dhfr) gene adjacent to the coding sequence. 
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Cells' can then be selected for methotrexate resistance 
in dhf r-def icient cells. See, e.g. Urlaub et al . 
(1980) Proc. Natl. Acad. Sci . USA 77:4216-4220; 
Ringold et al . (1981) J. Mol . and Appl . Genet. 1:165- 
5 175. 

Transcription termination and 
polyadenylation sequences may also be present on the 
selectable marker element, located 3' to the 
translation stop codon for the selectable marker. 

10 Examples of transcription terminator/polyadenylation 

signals include, but are not limited to, those derived 
from SV40, as described in Sambrook et al.,, supra, as 
well as a bovine growth hormone terminator sequence. 
Additionally, 5'- UTR sequences, can be placed adjacent 

15 to the coding sequence of the selectable marker in 

order to enhance expression of the same. Particularly 
useful with the present system are UTRs which include 
an Internal Ribosome Entry Site (IRES) present in the 
leader sequences of picornaviruses such as the 

20 encephalomyocarditis virus (EMCV) UTR (Jang et al . J. 
Virol. (1989) 63 : 1651-1660 . Other picornavirus UTR 
sequences that will also find use in the present 
invention include the polio leader sequence and. 
hepatitis A virus leader and the hepatitis C . IRES . 
.25 A particular advantage of structuring the 

system as described above is that those sequences 
downstream of the IRES elements, i.e., the selectable 
marker genes, may be expressed less vigorously than 
the gene of interest which is under the control of the 

3 0 promoter element. Thus, cells which are selected 

based on expression of the selectable marker gene will 
inherently be high expressors of the polypeptide of 
interest. 

If more than one selectable marker is 
3 5 present in the element, a spacer region can be 

incorporated between the two genes in place of the 
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stop cbdon, in order to assure continuous read -through 
of the marker genes. Such a spacer will normally 
include about 3 to about 150 nucleotides, more 
preferably about 6 to about 6 0 nucleotides, even more 
5 preferably about 6 to about 3 0 nucleotides, and most 

preferably about 12 to about 18 nucleotides. The only 
requirements for the spacer region is that continuous 
read- through is maintained and that the spacer does 
not encode a polypeptide sequence that will interfere 
10 with the function of the markers or the gene of 
interest. 

A particularly preferred selectable marker, 
element for use with the present invention includes a 
wildtype dhfr gene fused to an attenuated form of the 

15 gene conferring reduced neomycin resistance (termed 
neo r ) . See, e.g. , Blazquez et al . , Molec. Micro. 
(1991) 5:1511-1518 for a description of the mutated 
neo r gene. The attenuated gene encodes. an 
aminoglycoside-phosphotransferase with the aspartic 

20 acid at position 261 replaced by an asparagine . This 
element provides selection based on the ability of the 
transfected cell to grow, on genet icin- supplemented 
media, as well as amplification using the dhfr gene 
and selection based on methotrexate resistance. The 

2 5 element includes a spacer of 12 nucleotides that 

encodes the amino acid sequence Gly-Gly-Gly-Ser 
located between the dhfr and attenuated neo r genes, in 
place of the translation stop codon for the dhfr gene. 
The element also includes the EMCV IRES sequence 

3 0 located upstream of the dhfr and neo genes. Finally, 

a bovine growth hormone transcription termination 
sequence is located 3' to the translation stop codon 
for the neo sequence. This element is shown in 
Figures 1 and 2. - - . 

35 Prior to transf ection, appropriate enzymes 

are used to create overlapping untranslated sequences 
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between the gene of interest, promoter and selectable 
marker elements, to assure compliment arity between the 
three elements so that an ordered ligation of these 
elements occurs in the host cell. Such enzymes 
5 include restriction enzymes , exonucl eases and 

polymerases, such as T4 polymerase, all of which are 
well known in the art. For example, restriction 
endonucleases with various specificities have been 
isolated from a wide range of prokaryotes and are well 

10 known in the art. See, e.g., Sambrook et al . , supra. 
The choice of an appropriate restriction endonuclease 
depends on the particular sequence targeted. One of. 
skill in the art will readily recognize the proper 
restriction enzyme to use for a desired sequence. 

15 Generally, the overlapping sequences will include 
about . 3 to about 150 nucleotides, more preferably 
about 4 to about 50 nucleotides, and most preferably 
about 4 to about 10 nucleotides. Alternatively, 
blunt -end ligations can occur in vivo such that 

20 overlapping sequences are not necessary.. 

Furthermore, each of the fragments may 
include untranslated spacer residues, at the 5' and 3' 
ends, to protect the fragment from exonucleolytic 
activity in transfected cells. Such extensions will 
•25 generally be about 3 to as many as 1000 nucleotides in 
length, more typically 3-50 nucleotides in length, and 
preferably 5-2 5 nucleotides in length. 

Once the various elements are constructed, 
they. are used to transfect eucaryotic cells, such as 

3 0 mammalian, insect or yeast cells. Mammalian cells for 
use as hosts with the present invention include many 
primary and immortalized ,cell lines, available from 
the American Type Culture Collection (ATCC) , including 
but not limited to, Chinese hamster ovary (CHO) cells, 

3 5 HeLa cells, baby hamster kidney (BHK) cells, monkey 
kidney cells (COS) , human hepatocellular carcinoma 
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cells (e.g., Hep G2) , and a number of other cell 
lines. Yeast hosts include Saccharomyces cerevisiae 
Saccharomyces carlsbergeneis, Candida albicans, 
Candida maltqsa, Hansenula polymorpha, Kluyveromyces 
5 fragilis , Kluyveromyces lactis,, Pichia 
guillerimondii , Pichia pas tori s and 

Schizo saccharomyces pombe. Insect systems include for 
example recombinant baculovirus expression vectors for 
infection of inter alia, Aedes aegypti , Autographa 

10 calif ornica, Bombyx mori, Drosophila melanogaster , 
Spodoptera frugiperda, and Trichoplusia ni. . 

A wide variety of methods can be used to . 
deliver the gene of interest, promoter and selectable 
marker elements to mammalian cells. Such methods 

15 include DEAE dextran-mediated transf ection, calcium 

phosphate precipitation, polylysine- or polyornithine- 
mediated transf ection, or precipitation using other 
insoluble inorganic salts, such as strontium 
phosphate, aluminum silicates including bentonite and 

20 kaolin, chromic oxide, magnesium silicate, talc, and 
the like. Other useful methods of transf ection 
include electroporation, sonoporation, protoplast 
fusion, liposomes, peptoid delivery, or 
microinjection. See, e.g., Sambrook et al . , supra, 

25 for a discussion of techniques for transfecting 
mammalian cells. 

For example, the elements can be packaged in 
liposomes prior to delivery to the cells. Lipid 
encapsulation is generally accomplished using 

3 0 liposomes which are able to stably bind or entrap and 
retain nucleic acid. The ratio of condensed DNA to 
lipid preparation can vary but will generally be 
around 1:1 (mg DNA:micromoles lipid), or more of 
lipid. — For a review of the use of liposomes as 

3 5 carriers for delivery of nucleic acids, see, Hug and 
Sleight, Biochim. Biophys. Acta. (1991) 1097 : 1-17 ; 
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Straubinger et.al., in Methods of Enzymology (1983), 
Vol. 101, pp. 512-527. 

Liposomal preparations for use with the 
present invention include cationic (positively 
5 charged),, anionic (negatively charged) and neutral 
preparations, with cationic liposomes particularly 
preferred. Cationic liposomes are readily available. 
For example, N [1-2 , 3 -dioleyloxy); propyl] -N,N, N- 
triethylammonium (DOTMA) liposomes are available under 

10 the trademark Lipof ectin, from GIBCO BRL, Grand 

Island, NY. (See, also, Feigner et al . , Proc. Natl. 
Acad. Sci. USA (1987) 84 : 7413-7416) . Other 
commercially available lipids include transfectace 
(DDAB/DOPE) and DOTAP/DOPE (Boerhinger) . Other 

15 cationic liposomes can be prepared from readily 

available materials using techniques well known in the 
art. See, e.g., Szoka et al . , Proc. Natl. Acad. Sci. 
USA (1978) 75:4194-4198; PCT Publication No .. WO 
90/11092 for a description of the synthesis of DOTAP 

20 (1, 2 -bis (oleoyloxy) -3- (trimethylammonio) propane) 
liposomes. The various liposome-nucleic acid 
complexes are prepared. using methods known in the art. 
See, e.g., Straubinger et al . , in METHODS OF 
IMMUNOLOGY (1983), Vol. 101, pp. 512-527; Szoka et 

25 al., Proc. Natl. Acad. Sci. USA (1978) 7.5:4194-4198; 

Papahadj opoulos et al . , Biochim. Biophys . Acta. (1975) 
194:483; Wilson et al . , Cell (1979) 17:77) ; Deamer and 
Bangham, Biochim. Biophys. Acta. (1976) 443:62 9; Ostro 
et al . , Biochem. Biophys . Res . Commun. (1977) 7 6:836; 

30 Fraley et al . , Proc . Natl . Acad. Sci. USA (1979) 

26:3348); Enoch and Strittmatter , Proc. Natl. Acad. 
Sci. USA (1979) 76:145) ; Fraley et al . , . J\ Biol. Chem. 
(1980) 255:10431; Szoka and Papahadj opoulos , Proc. 
Natl. Acad. Sci. USA (1978) 75:145; and Schaefer- 

35 Ridder et al . , Science (1982) 215 : 166 . 
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The DNA can also be delivered in cochleate 
lipid compositions similar to those described by 
Papahadjopoulos et. al . , JBiochem. Biophys . Acta. (1975) 
394:483-491. See, also, U.S. Patent Nos. 4,663,161 

* 5 and 4, 871,488 . 

After transf ection, cells which express the 
gene of interest are selected away from those that do 
not, based on expression of the selectable marker. 
For example, if the selectable marker element used 

10 confers resistance to a cytotoxic agent, the cells can 
be contacted with the appropriate cytotoxic agent, 
whereby nontransf ected cells, as well as nonexpressing 
or very low expressing cells, are negatively selected 
away from the transf ected cells . If the selectable 

15 marker is a cell surface marker, the cells can be 
contacted with a binding agent specific for the 
particular cell surface marker, whereby the 
transf ected cells can be positively selected away from 
the population. 

20 The selection step can also entail 

fluorescence-activated cell sorting (FACS) techniques, 
such as where FACS is used to select cells from the 
population containing a particular surface marker. 
These cell sorting procedures are described in detail, 

2 5 for example, in the FACSVantage™ Training Manual, with 
particular reference to sections 3-11 to 3-28 and 10-1 
to 10-17. The selection step may also use 
magnetically responsive particles as retrievable 
supports for target cell capture and/or background 

.30; removal. These and similar separation procedures . are 
described, for example, in the Baxter Immunotherapy 
Isolex training manual. 

Various modifications, of the above -described 
:.. ..-system can. be .made in order. to__ enhance .ligation. of_ the 

35 various fragments in the host cell. For example, cell 
lines expressing a ligase in the cytoplasm, such as 
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bacteriophage T4 DNA ligase, can be used in the 

methods of the invention. This ligase acts to join 

complementary, cohesive DNA termini and may be used to 

aid in linking the various elements in an orderly 

5 fashion. E. coli DNA ligase can also be used with the 

method of the invention. 

Additionally, techniques can be used to aid 

in selection of clones containing the properly ligated 

construct. For example, a toxin cassette, driven by a 

10 second IRES sequence, can be incorporated into the 
selectable marker element, upstreanrof the IRES 
sequence which is associated with the selectable 
marker gene. A splice acceptor sequence is then 
located at the junction of the first transcription 

15 terminator and the IRES sequence adjacent the 

selectable marker gene. The gene of interest is then 
designed to include a splice donor at its 3' end. 
Accordingly, correct ligation between the gene of 
interest and the selectable marker element ensures 

2 0 removal of the toxin gene. Independent integration of 
the selectable marker element bearing the toxin gene ... 
-will lead to cell death and hence, no clones. 

Alternatively, GFP or its derivatives, such 
as but not limited to EGFP, may replace the drug 

25 resistance gene in the selectable marker cassette, 
permitting FACS cell sorting of highly expressing 
clones, as described above. 

Once expressed, the product may be isolated 
and purified by any number of techniques, well known 

30- in the art, including : chromatography , e.g., HPLC, 

affinity chromatography, ion exchange chromatography, 
size-exclusion, etc.; electrophoresis; density 
gradient centrif ugation; solvent extraction, or the 
like. See, e.g., Protein Purification Principles and 
.35 Practice, 2nd edition (Robert K. Scopes ed. 1987); and 
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Protein Purification Methods/ a Practical Approach 
(E. L.W Harris and S. Angal , eds . 1990) . 

III. Experimental 
5 Below are examples of specific embodiments 

for carrying out the present invention. The examples 
are offered for illustrative purposes only, and are 
not intended to limit the scope of the present 
invention in any way. 
10 Efforts have been made to ensure accuracy , 

with respect to numbers used (e.g. , amounts, 
temperatures, etc.), but some experimental error and 
deviation should, of course, be allowed for. 

15 Example 1 

Recombinant Production of HBV preS2 

Materials and Methods 

Cell Culture : dhfr- negative CHOdg44 (Urlaub 
20 et al. , (1980) Proc. Natl. Acad. Sci . USA 77:4216- 
4220) cells were grown in Ham's nutrient F12 medium 
(JRH, Inc. Lenexa, KA) supplemented- with , penicill in 
(2nM) , streptomycin (100 fig/ml ) , glut amine (2nM) , 
proline (2 /zg/ml), , and non-essential amino acids 

2 5 (Gibco) . DNA fragments were introduced into the cells 

by LT1 (Pan Vera Corp. Madison, WI) transf ection . 
Briefly,. 2 /xg of each DNA fragment to be transfected 
were mixed (unligated) prior to addition of 30 /ig of 
the LT1 reagent. This mixture was applied to twice 
30 washed monolayers in serum-free OptiMEM (Gibco) and 
cells incubated 4h, 37°C. Monolayers were washed 
once, overlayed with growth medium for 4 8h, then 
subcultured 1:10 into, selective medium. The dhfr 
- selective medium consisted of Ham' s -F12 -medium lacking 

3 5 hypoxanthine , thymidine, and glycine that was 

supplemented as for the nutrient F12 medium except for 

-30- 
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the use of 10% dialyzed fetal calf serum; for dual 
dhfr/ neo r selection, dhfr selective medium was 
supplemented with 250 j-tg/ml geneticin (Gibco) , After 
approximately 15-21 days individual colonies were 
5 isolated to individual wells of a 96 well plate and 
expanded in selective media lacking geneticin. When 
clones reached confluence in the 96 well plate, they 
were transferred to an individual well of a 24 well 
plate and further expanded. At confluence or at near 
„ 10 confluence in the 24 well plate, individual clones 

were tested for HBV surface antigen (sAg) expression. 
Positive clones were transferred to T 25 (or 25 cm 2 ) 
culture flasks and were carried for 6-8 passages to 
assess stability of expression by assaying medium from 

15 near confluent monolayer cultures that had been 
incubated 48h in fresh culture medium. Stably 
expressing clones of comparable sAg titer were pooled 
and subjected to methotrexate (Mtx) amplification in 
dhfr selective medium at increasing concentrations of 

20 drug (10, 20, 50 nM) . Individual clones from first 

round amplification were analyzed as described above; 
clones of comparable sAg titer were pooled and 
subjected to a second round of Mtx amplification at 
increasing concentrations of drug (50, 100, 500, or 

2 5 100 0 nM) . Individual clones from this round of ; 

amplification were similarly analyzed as described 
above . 

Plasmids and DNA manipulations : 

i 

3 0 ' Plasmid pCMVII : The CMV enhancer/promoter 

plus Intron A was transferred from pCMV6 (Chapman et 
al., Nuc. Acids Res. (1991) 19:3979-3986) as a 
Hindlll-Sall fragment into pUC19 (New England Biolabs, 
Inc., Beverly, MA) together with the bovine growth 
35 hormone terminator from pCDNA3 (Invitrogen, Inc., 
Carlsbad, CA) to give pCMVII. 
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Plasmid pET-EMCV-DHFR/Neo : The EMCV leader 
was PGR- amplified from pCite-4a+ (Novagen, Inc. , 
Milwaukee, WI) and inserted into pET-23d (Novagen, 
Inc., Milwaukee, WI) as an Xba-Nco fragment to give 
5 pET-EMCV. The dhfr gene was PCR-amplif ied using 

primers 38 and 75, described below, to give a product 
with a Gly<-Gly-Gly-Ser spacer in place of the 
translation stop codon and inserted as an Nco- BairiH.1 
fragment to give pET-E-DHFR. Next, the attenuated neo 

10 gene was PCR amplified from pSV2Neo (Clontech, Palo 
Alto, CA) using primers 91 and 93, described below, 
and inserted into the unique BamHl site of pET-E-DHFR 
to give pET-E-DHFR/Neo (m2) . Finally the bovine growth 
hormone terminator was transferred from pCMVII and 

15 inserted downstream of the neo gene to give pET-E- 
DHFR / Neo (m2 > BGHt (Figure 1). 

CMV Promoter Element : For preparation of 
the CMV promoter (CMVp) fragment for cell 

20 transf ections , pCMVII was cleaved with Hindi I I and 
Sail; the EMCV- dhfr/ neo selectable marker cassette 
fragment was prepared by cleavage of pET-E- 
DHFR/Neo (m2) BGHt with Xbal and Notl . Plasmids were 
purified using Qiagen Maxi purification kits (Qiagen 

25 Corp., Santa Clarita, CA) 

PCR Primers: The primers for use in 
obtaining the gene encoding hepatitis B virus (HBV) 
preS2, were primer 83 (5'- 

3 0 GTCGACGACAGATGCAGTGGAATTCCACT-3 ' ) (SEQ ID NO: J and 

primer 84 ( 5 ' - GCTAGCGCAGATTAAATGTATACCCAGAGACA- 3 ' ) 

(SEQ ID NO:_ ) . The mutant neo primers were 91 

(5' tcagaagaactcgttaagaa-3' ) (SEQ ID NO:_ ) and 93 

( 5 ' - TTGAACAAGATGGATT- 3 ' ) ( SEQ, ID NO : _ ) . The DHFR 

35 primers were 38 (5 ' -CCATGGTTCGACCATTGGAA- 3 7 ) ( SEQ 

I D : __ ) and 7 5 ( 5 ' - GGATCCCCGCCGTCTTTCTTCTCGTA- 3 ' ) ( SEQ 
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ID NO: J.. The junction RT-PCR primers for the CMVp- 

preS2 junction were 126 (5 ' -GGGAACGGTGCATTGGAA-3 ' ) 

(SEQ ID NO: ) and 127 (5 ' -CCCTGACTCTGGGATCC-3 ' ) ; (SEQ 

ID NO : ) or 95 ( 5 ' -CCACGGTGTTTTGACCTC-3 ' ) (SEQ ID 

5 NO:_ ) and 96 (5 ' -CCGTACTGGTTGTTGTTGA-3 ' ) (SEQ ID 

NO: ) ; primers for the preS2-EMCV leader junction 

were 128 (5 ' -CGTGAGTCCCTTTATACCG-3 ' ) (SEQ ID NO: J 

.and 129 (5 ' -CGGCGAGTAACGTTAGGG-3 ' ) (SEQ ID NO:^ ) or 

9 8 (5 ' -TATAGACAAACGCACACCG-3 ' ) (SEQ ID NO: ) and 123 

10 ( 5 ' - AAC C AGTAC GGGAC CATG - 3 ' ) ( SEQ ID NO : J . 

PCR amplification of preS2 : The HBV gene 
encoding preS2 was' prepared by a modified hot-start ' 
PCR amplification prior to each transfection using the 

15 thermostable proofreading polymerase pfu (Stratagene, 
La Jolla, CA) . The source of the template DNA was 
extracted human donor plasma containing subtype adw 
HBV (Boston Biomedica, Inc., West Bridgewater, MA) 
Cycling conditions were 95°C for 3 min, , addition of 

20 enzyme, followed by 5 cycles of 95°C, for 1 min. 50°C 
for 1 min. , 72°C for 2 min., followed by 30 cycles of 
95°C for 1 min., 76°C for 1 min., 72°C for 2 min., 
followed by 1 cycle of 72 °C for 10 min. PCR products 
were digested with Sail and iVhel to give compatible 

25 termini with those of the CMV promoter and selectable 
marker fragments. 

Immunodetection: Surface antigen titers 
were determined using the Auzyme EIA (Abbott 

30 Laboratories, 1 Abbott Park, IL) on supernatants from 
individual clones according to supplier's 
instructions.. For radiolabelling, 8xl0 5 cells were 
seeded into 60 ,mm plates 16h prior to labelling. 
Cells were starved for lh in. serum- free DMEM lacking 

3 5 methionine and cysteine then labelled 3h with 10/iCi 35 S- 
Redivue Pro-Mix (Amersham, Arlington Heights, IL) 
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followed by a 16h chase. Supernatants were collected 
and cells lysed in NP40 lysis buffer (Selby et al M 
Virology (1994) 204 : 114-122) . Cleared supernatants 
and lysates were incubated with protein A sepharose- 
5 bound rabbit anti-sAg serum (Accurate Chemicals, 
Westbury, NY) that was previously blocked lh with 
unlabelled CHO cell supernatant or lysate . 

Analysis of transcripts for ordered linkage: 

10 For each assay, total RNA from approximately 5x1 0 6 

phosphate-buffered saline-washed cells was isolated by 
lysing cells in 1ml RNA-Stat 60 (Tel-Test B, Inc., . 
Friendswood, TX) and extracting once with CHC1 3 
followed by isopropanol precipitation. Random-primed 

15 reverse transcript ion reactions were performed using 
the GeneAmp RNA PCR kit (Perkin Elmer/Roche Molecular 
Systems, Branchburg, NJ) substituting the kit RT 
enzyme with MuMLV RT (200/i/ml) (BRL) . PCR- 
amplif ication of the CMV-promoter-preS2 junction was 

20 done using primers 126 and 127; for the preS2-EMCV 

leader junction primers 128 and 129. For an internal 
control, reverse transcription .and PCR amplification 
of the actin gene using primers AW75 and AW76 was 
done. Junction products were sequenced for comparison 

25 to the input fragment sequences. 

Results 

As explained above, dhfr- negative CH0dg44 
cells were transfected with three individual DNA 

30 elements: (i) a Hindi II -Sail fragment from pCMVII (see 
Figures 1 and 2) , containing the human CMV promoter 
with a short spacer incorporated • between exon 2 of the 
CMV immediate early transcription unit and the Sail 
cloning site; (ii)- PCR- amplified DNA coding for the 

3 5 HBV PreS2 gene including 5 base pair spacers located 
between the terminal nucleotides of the gene and the 
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terminal Sail and Nhel restriction enzyme sites of the 
fragment '„ (Figure 2); and (iii) an Xbal-Notl fragment 
from pET - E - DHFR/Neo (m2) BGHt containing the EMCV- 
DHFR/Neo' (m2) selection cassette (Figures 1 and 2) . 
5 Approximately , 100 geneticin-resistant , DHFR + 

clones were picked and analyzed for' sAg expression. 
Initial titers ranged from 0-90 ng/ml (average was 
3 0ng/ml) (Figure 3) . A subset of clones showing 
relatively higher initial expression were further 

10 expanded. Two of those that continued to show high 

expression after further expansion, T21.18 and T21.50 
(the parental clones, indicated with arrows in Figure 
3) , were shown to contain preS2 genes of correct 
sequence as determined by sequencing of RT-PCR 

15 products generated from total cellular RNAs . This 
analysis showed that for both clones, nucleotide 
sequence changes relative to the predicted sequence 
were introduced in the junctional regions at either 
end of the preS2 gene . . However, none, of these changes 

20 interfered with the. splice acceptor site of exon2 that 
lies adjacent to the 3' terminus of the CMV promoter 
fragment, nor did any of the changes alter the 
sequence of the preS2 gene itself. 

These two clones were subsequently subjected 

25 to a single round of methotrexate selection at 20, 50, 
and 100 nM drug concentrations. 

As shown in Figures 4 A and 4B, an 
unprecedented range of amplifications were observed 
for both parental clones. For each, the majority of 

3 0 subclones arising from selection showed up to a 

twenty-f old. enhancement of sAg expression relative to. 
the starting parental clone titers. While, this level 
of amplification is very impressive in comparison to 
the two to ten- fold amplifications that are' more 

35. typically seen using standard selection vectors, even 
more impressive are the subset of clones that showed 
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amplification of up to sixty-fold over the parental 
clones. This was most evident for those selected in 
50 and" 100 nM methotrexate. In the case of the 
T21 . 50 -derived selectants, a. direct relationship was 
5 observed between drug concentration and the percentage 
of clones showing amplification in the range of twenty 
to forty-fold, suggesting that there is potential 
efficiency gained by selecting at higher drug 
concentrations. . . 

10 Based on the observation that continued 

passage of high expressors can result in a 
"stabilizing" of expression, often at higher than 
initial levels, a number of the highest expressing 
clones from each set . were passaged 6-8 times in T- 

15 flasks, then reassayed for sAg expression. On the 
whole, the titers rose, with one clone (T21.18.4) 
showing an extraordinary ninety- fold amplification as 
compared to the preamplif ication titer of the parental 
clone (see Figure 5) . This level of amplification 

20 represents expression in the range of 6 mg(est.)/l0 9 
c/day. 

To verify that authentic preS2 and sAg 
proteins were being made by these recombinant cell 
clones, one of the highest expressing clones 
25 . (T21. 18.25) was radiolabelled and proteins from the 
culture, medium and cell lysates were 

immunoprecipitated using sAg-specif ic polyclonal sera. 

As shown in Figure 6, glycosylated and unglycosylated 

versions of sAg and preS2 were found in the cell 
3 0 lysates and culture medium. The sizes of the 

precipitated proteins, p2 4, gp2 7, p3 6, and gp3 9 

corresponded well to the expected sizes of 

unglycosylated/glycosylated sAg and 
- - unglycosylated/glycosylated- preS2 , respectively.. 
3 5 Accordingly, novel noncloning methods for 

recombinant ly producing proteins are disclosed. 



-36- 



WO 99/53046 



PCT/US99/08069 



Although preferred embodiments of the .subject 
invention have been described in some detail, it is 
understood that obvious variations can be made without 
departing from the -spirit and the scope of the 
5 invention as defined by the appended claims. 
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Claims 

1. A method of expressing at least one 
recombinant polypeptide comprising: 

5 (a) providing as separate elements (i) a 

first nucleic acid element which comprises a promoter; 
(ii) a second nucleic acid element which comprises at 
. least one selectable marker gene conferring a 

selectable phenotype on a cell transfected therewith, 

10 an internal ribosome entry site (IRES) sequence 

positioned upstream of the selectable marker gene, and 
a transcription termination sequence positioned 
downstream of the selectable marker gene, and (iii) a 
third nucleic acid element which comprises at least 

15 one gene encoding said at least one polypeptide; 

(b) cotransf ecting a population of mammalian 
cells with each of said first, second and third 
elements; 

(c) culturing said population of cells under 
20 conditions whereby the polypeptide and the at least 

one selectable marker are expressed; 

(d) selecting cells which express the 
selectable marker; and 

(e) identifying those cells from the 
25 selected cells that express the recombinant 

polypeptide. 

2. The method of claim 1, wherein the 
promoter is the human cytomegalovirus (CMV) immediate- 

30 early enhancer/promoter and includes the CMV intron A. 

3. The method of claim 1, wherein the at 
least one selectable marker gene comprises a neo r gene. 

35 4. The method of claim 3, wherein the neo r 

gerie confers reduced resistance to geneticin on a cell 
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transf ected therewith as compared to the wild-type neo r 
gene . 

5. The method of claim 1, wherein the at 
5 least one selectable marker gene comprises a dhfr 

gene. 

6. The method of claim 1, wherein the IRES 
sequence is derived from encephalomyocarditis virus. 

10 • 

7. The method of claim 1, wherein the 
polypeptide encoded by the third nucleic acid element 
is a viral protein. 

15 .8. The method of claim 7, wherein the viral 

protein is an hepatitis virus protein. 

9. The method of claim 8, wherein the 
hepatitis protein is an hepatitis B virus (HBV) 

20 protein . 

10. The method of claim 9, wherein the HBV 
protein is HBV preS2 . 

25 11. The method of claim 1 , wherein the 

third, nucleic acid element is a cDNA library. 

12. The method of claim 1,, wherein the 
third nucleic acid element is a PCR product. 

30 

13. The method of claim 1, wherein the 
third nucleic acid element is from a DNA virus. 

14.. The method, of claim 1, wherein the gene 
35 of the third nucleic acid element is bounded on its 5' 
end with a nucleotide sequence of about 3 to about 150 



-39- 



WO 99/53046 



PCT/US99/08069 



nucleotides complimentary to the 3' end of the first 
nucleic acid element, and bounded on its 3' end with a 
nucleotide sequence of about 3 to about 150 
nucleotides, complimentary to the 5' end of the second 
5 nucleic acid , element . 

15. A method of expressing at least one 
recombinant polypeptide comprising: 

(a) providing as separate elements (i) a 

10 first nucleic acid element which comprises a promoter, 
(ii) a second nucleic acid element which comprises at 
least one selectable marker gene selected from the 
group consisting of a neo r gene and a dhfr gene, an 
encephalomyocarditis virus (EMCV) internal ribosome 

15 entry site (IRES) . sequence positioned upstream of the 
selectable marker gene, and a transcription 
termination sequence positioned downstream of the 
selectable marker gene, and (iii) a third nucleic acid 
element which comprises at least one gene encoding 

2 0 said at least one polypeptide, wherein the gene is 
bounded on its 5' end with a nucleotide sequence of 
about 3 to about 150 nucleotides complimentary to the 
3' end of the first nucleic acid element, and bounded 
on its 3' end with a nucleotide sequence of about 3 to 

25 about 150 nucleotides, complimentary to the 5' end of 
the second nucleic acid element; 

(b) cotransf ecting a population of mammalian 
cells with each of said first, second and third 
elements 

30 (c) culturing said population of cells under 

conditions whereby the polypeptide and the at least 
one selectable marker are expressed; 

(d) selecting cells which express the 
selectable marker; and 
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(e) identifying those cells from the 
selected cells that express the recombinant 
polypeptide. 

5 .16. The method of claim 15, wherein the 

promoter is the human cytomegalovirus (CMV) immediate - 
early enhancer/promoter and includes the CMV intron A. 

17. The method of claim 15, wherein the 

10 second nucleic acid element comprises both a neo r gene 
and a dhfr gene. 

18. The method of claim 17, wherein the neo r 
gene is positioned downstream of the dhfr gene. 

15 

19. The method of claim 18, wherein the 
stop codon for the dhfr gene is replaced with a spacer 
of about 3 to about 150 nucleotides,, such that 
transcription and translation of both the dhfr and ned r 

2 0 genes occurs. 

20. The method of claim 19, wherein the 
spacer is about 12 to about 18 nucleotides. 

25 21. -The method of claim 15, wherein the neo r 

gene confers reduced resistance to geneticin on a cell 
transfected therewith as compared to the wild-type neo r 
gene . 

■ 

30 22. The method of claim 15, wherein the 

polypeptide encoded by the third nucleic acid element 
is a viral protein. 

23. The method of . claim 22, wherein the 
35 viral protein. is an hepatitis virus protein. 
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24. The method of claim 23, wherein the 
hepatitis protein is an hepatitis B virus (HBV) 
protein. 

5 25. The method of claim 24, wherein the HBV 

protein is HBV preS2 . , 

26. The method of claim 15, wherein the 
third nucleic acid element is a cDNA library. 

10 

27. The method of claim 15, wherein the 
third nucleic acid element is a PCR product. 

28. The method of claim 15, wherein the 
15 ' third nucleic acid element is from a DNA virus. 

29. A selectable marker nucleic acid 
element comprising: (a) a neo r gene, (b) a dhfr gene 
positioned upstream of the neo r gene, wherein the stop 

20* codon for the dhfr gene is replaced with a spacer of 
about 3 to about 15 0 nucleotides, such that 
transcription and translation of both the dhfr and neo r 
genes occurs, (c) an internal ribosome entry site 
(IRES) sequence positioned upstream of the dhfr gene, 

25 (d) and a transcription termination sequence 
positioned downstream of the neo r gene. 

30. - The selectable marker element of claim 
29, wherein the neo r gene confers reduced resistance to 

3 0 geneticin on a cell transfected therewith as compared 
to the wild- type neo r gene. 

• 31. The selectable marker element of claim 

29, wherein the IRES sequence is derived from 
35 encephalomyocarditis virus. 
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32. The selectable marker element of claim 
29, wherein the transcription termination sequence is 
a bovine growth hormone transcription termination 
sequence. 

.5 

33. The selectable marker element of claim 
29, wherein the spacer is about 12 to about 18 
nucleotides. 

10 34 . A selectable marker nucleic acid 

element comprising: (a) a neo r gene which confers 
reduced resistance to geneticin on a cell transfected 
therewith as compared to the wild-type neo r gene, (b) a 
dhfr gene positioned upstream of the neo r gene, wherein 

15 the stop codon for the dhfr gene is replaced with a 

spacer of about 12 to about 18 nucleotides, such that 
transcription and translation of both the dhfr and neo 
genes occurs, (c) an encephalomyocarditis virus 
internal ribosome entry site (IRES) sequence 

20 positioned upstream of the dhfr gene, (d) and a bovine 
growth hormone transcription termination sequence 
positioned downstream of the neo r gene. 

35. A mammalian host cell transfected with 
25 the selectable marker nucleic acid element of claim 

29. 

36. A mammalian host cell transfected with 
the selectable marker nucleic acid element of claim 

30 34. 
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